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Abstract 

We characterize all quasiperiodic Sturmian words: a Sturmian word is not quasiperiodic if 
and only if it is a Lyndon word. Moreover, we study links between Sturmian morphisms and 
quasiperiodicity. 
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1 Introduction 

The notion of repetition in Strings is central in a lot of researches, in particular in Combinatorics 
on Words and in Text Algorithms (see for instance [H], ^H] for recent surveys). In this vein, 
Apostolico and Ehrenfeucht introduced the notion of quasiperiodic finite words [2] in the following 
way: "a string w is quasiperiodic if there is a second string u ^ w such that every position of w 
falls within some occurrence of u in w". The reader can consult [X] for a short survey of studies 
concerning quasiperiodicity. In |12j . Marcus extends this notion to right infinite words and he opens 
six questions. Four of them are answered in [Jj. 

One of these six questions is: does there exist a non-quasiperiodic Sturmian word? In [Jj, we 
provide an example of such a word, but this positive answer is not completely satisfying. Since a 
first feeling can be that there exists no (or at most very few) such word, one can ask for a complete 
characterization of such non-quasiperiodic Sturmian words. After some preliminaries in Sections 2, 
3 and 4, we provide two answers described below. 

Sturmian words have been widely studied because of their many beautiful properties and links 
with many fields (see Chapter 2] for a recent survey). One aspect of these words is that they 
can be infinitely decomposed over the four morphisms L a , Lb, R a and Rb (see Section [31 for more 
details). The first characterization of non-quasiperiodic Sturmian words proposed in this paper is 
based on such a decomposition. More precisely, Theorem 15.61 states that a Sturmian word is not 
quasiperiodic if and only if it can be decomposed infinitely over {L a , Rb} or infinitely over {Lb, R a }- 

Our second characterization ( Theorem 16.5)1 provides a more semantic answer: a Sturmian word 
is not quasiperiodic if and only if it is an infinite Lyndon word. 

The proof of our first result uses the fact that some morphisms obtained by compositions of 
the morphisms L a , Lb, R a and Rb map any infinite words into a quasiperiodic one. We call such a 
morphism strongly quasiperiodic. In Section 7, we characterize the Sturmian morphisms which are 
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strongly quasiperiodic. Let us quote that any Sturmian morphism / is quasiperiodic, that is there 
exists a non-quasiperiodic word w whose image by / is quasiperiodic. 

2 Generalities 

We assume the reader is familiar with combinatorics on words and morphisms (see, e.g., [HI El)- We 
precise our notations. 

Given a set X of words (for instance an alphabet A, that is a non-empty finite set of letters), 
X* (resp. X u ) is the set of all finite (resp. infinite) words that can be obtained by concatenating 
words of X. The empty word e belongs to X* . The length of a word w is denoted by \w\. By \w\ a 
we denote the number of occurrences of the letter a in w. A finite word u is a factor of a finite or 
infinite word w if there exist words p and s such that w = pus. If p = s (resp. s = e), u is a prefix 
(resp. suffix) of w. A word u is a border of a word w if u is both a prefix and a suffix of w. A factor 
u of a word w is said proper if w ^ u. 

Given an alphabet A, a(n endo) morphism f on A is an application from A* to A* such that 
f(uv) = f(u)f(v) for any words u, v over A. A morphism on A is entirely defined by the images 
of letters of A. All morphisms considered in this paper will be non-erasing: the image of any non- 
empty word is never empty. The image of an infinite word is thus infinite and naturally obtained as 
the infinite concatenation of the images of the letters of the word. In what follows, we will denote 
the composition of morphisms by juxtaposition as for concatenation of words. Given a set X of 
morphisms, we will also note X* the set of all finite compositions of morphisms of X and X u the 

set of all infinite decompositions of morphisms of X. When a word w is equal to lim /1/2 • ■ ■ fn(a), 

n—>oo 

fi G A, we will say that w can be decomposed (infinitely) over X. 

Given a morphism /, powers of / are defined inductively by / = Id (the Identity morphism), 
P = f P^ 1 for integers i > 1. When for a letter a, /(a) = ax with i/e, the morphism / is said 
prolongable on a. In this case, for all n > 0, f n (a) is a prefix of f n+1 (a). If moreover, for all n > 0, 
|/ n (a)| < |/ n+1 (a)|, the limit lim f n {a) is the infinite word denoted f u (a) having all the f n {a) as 
prefixes. This limit is also a fixed point of /. 

3 Sturmian words and morphisms 

Sturmian words may be defined in many equivalent ways (see chapter 2] for instance). They are 
infinite binary words. Here we first consider them as the infinite balanced non ultimately periodic 
words. We recall that a (finite or infinite) word w over {a, b} is balanced if for any factors u and v 
of same length ||u| — |u| | < 1, and that an infinite word w is ultimately periodic if w = uv u for 
some finite words u and v. 

Many studies of Sturmian words use Sturmian morphisms, that is morphisms that map any 
Sturmian word into a Sturmian word. Seebold fTj\ proved that the set of these morphisms is 
{E, L a , Lb, R a , Rb}* where E, L a , Lb, R a , Rb are the morphisms defined by 

E \ a^b ^ I awd ^ \a<-^ba^ \a^a ^ j a 1— ► ab 
' 1 b 1— ► a, a ' 1 b 1— > ab, 1 b 1— ► b, a ' 1 b <— > ba, | b 1— ► b. 

Many relations exist between Sturmian words and Sturmian morphisms. For instance, recently 
the following result was proved: 
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Theorem 3.1 jS] Any Sturmian word w over {a, b} admits a unique representation of the form 
w = lira L^-^R^L^-^Rl 2 . . . Lf^-^"- 1 R^- 1 L d b 2 n ~ C2n Rf n {a) 
where df. > Ck > for all integer k > 1, > 1 for k > 2 and if Ck = d^ then c^-i = 0. 

Remark: Let us mention that this representation is not expressed as in p] where it is written 

w = T C1 L dl T C2 Lf 2 T° 3 if 3 T C4 Lf 4 . . . 

where T is the shift map defined, for any infinite word (a n ) n >o with a n letter for any n > 0, by 
T(a n ) n >o = (o n +i) n >o. One can verify that for integers c, d such that d > c > and for any infinite 
word w, T c L d a {w) = L d a - c R c a {w) and T c L d (w) = L d - c R c b (w). This explains the links between the 
two representations. The interested reader will also find relations between this representation and 
the notion of S-adic systems defined by Ferenczi jg] as minimal dynamical systems generated by a 
finite number of substitutions. 

A particular well-known family of Sturmian words is the set of standard (or characteristic) 
Sturmian words. It corresponds to the case where for each k > 0, = 0. Hence any standard 
Sturmian word admits a unique representation on the form: 

w = lim L dl L? t 2 L d3 L d h 4 . . . L^- 1 L d h 2n (a) 

n^oo a b a b a b \ I 

where d\ > and d^ > 1 for all k > 2. 

To end this section, we recall useful relations between Sturmian morphisms. 

Theorem 3.2 jH] (see also for a generalization) The monoid {L a , Lb, R a , Rb, E}* of Sturmian 
morphisms has the following presentation: 

(1) EE = Id, 

(2) EL a = L b E and ER a = R b E, 

(3) L a LlR a = R a RlL a , for any n > 0. 

Note that from (2) and (3), we get: L^L^Rb = RbR^Lb for any n > 0. 

4 Word quasiperiodicity and morphisms 

In this paper, we consider mainly infinite quasiperiodic words. However we first recall the notion of 
finite quasiperiodic words to allow us some comparisons. 

We consider definitions from PJ. A word u covers another word w if for every i € {1, . . . ,\w\}, 
there exists j E {1, . . . , \z\} such that there is an occurrence of u starting at position i — j + 1 in 
the word w. When u ^ w, we say that u is a quasiperiod of w and that w is quasiperiodic. A word 
is superprimitive if it is not quasiperiodic (Marcus ^2] calls minimal such words). One can observe 
that any word of length 1 is not quasiperiodic. The word 

w = abaababaabaababaaba 

has aba, abaaba, abaababaaba as quasiperiods. Only aba is superprimitive. More generally in PJ, it 
is proved that any quasiperiodic finite word has exactly one superprimitive quasiperiod. This is a 
consequence of the fact that any quasiperiod of a finite word w is a proper border of w. 

3/01 



When denning infinite quasiperiodic words, instead of considering the starting indices of the 
occurrences of a quasiperiod, for convenience, we choose to consider the words preceding the oc- 
currences of a quasiperiod. An infinite word w is quasiperiodic if there exist a finite word u and 
words (p n )n>o such that Po = £ and, for n > 0, < |p n +i| — \Pn\ — M and p n u is a prefix of w. 
We say that u covers w, or that w is u- quasiperiodic. The word u is also called a quasiperiod and 
we say that the sequence (p n u )n>o is a covering sequence of prefixes of the word w. The reader will 
find several examples of infinite quasiperiodic words in dUE|. Let us mention for instance that the 
well-known Fibonacci word, the fixed point of the morphism ip: a ab, b \— ► a is a&a-quasiperodic. 

It is interesting to note that <^ w (a) has an infinity of superprimitive quasiperiods (see [2| for a 
characterization of all quasiperiods of ip UJ (a)). This shows a great difference between quasiperiodic 
finite words and quasiperiodic infinite words. The reader can also note that for any positive integer 
n, there exists an infinite word having exactly n quasiperiods (as for example the word (ab) n a(ab) tJ )) , 
or having exactly n superprimitive quasiperiods [Zj. 

To end this section, let us observe that any quasiperiod of a (finite of infinite) quasiperiodic word 
w is a prefix of w. Hence w has a unique quasiperiod of smallest length that we call the smallest 
quasiperiod of w. When w is finite, the smallest quasiperiod of w is necessarily its superprimitive 
quasiperiod. When w is infinite, its smallest quasiperiod is also superprimitive, but there can exist 
other superprimitive quasiperiods (see above). 

Moreover: 

Lemma 4.1 If w is an infinite quasiperiodic word with smallest quasiperiod u, then uu is a factor 
ofw. 

Proof. If uu is not a factor of w then the prefix v of u of length \u\ — 1 is a quasiperiod of w. This 
is not possible if u is the smallest quasiperiod. □ 

Let us observe that Lemma l4~T1 is not true for finite words as shown by the a6a-quasiperiodic 
word ababa. 

In the following we will also use the immediate following fact: 

Fact 4.2 If w is a (finite or infinite) u- quasiperiodic word and f is a non-erasing morphism, then 
f(w) is f (u) -quasiperiodic. 

5 Sturmian non-quasiperiodic words 

In this section, we prove our main result (Theorem 15 .6p which is a characterization of all non- 
quasiperiodic Sturmian words. Before this, we prove several useful results. 

Let w be a Sturmian word. Denoting by n the least number of a between two consecutive b 
in w and by i the initial number of a in w, we can deduce from the balance property of w that 
w belongs to a % {ba n , 6a n+1 } w . When < i < n, w belongs to {a l ba n ~\ a t ba n+l ^ % } u) and w is 
a*6a n_l+1 -quasiperiodic (and a l ba n ~ t+1 is the smallest quasiperiod of w). Thus: 

Fact 5.1 If w is a non-quasiperiodic Sturmian word, then there exists an integer n such that w 
belongs to a n+1 b{a n b, a n+1 b}" U {ba n , ba n+1 } w . 
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Of course some Sturmian words in a n+1 b{a n b,a n+1 b} UJ U {ba n , 6a" +1 } w are quasiperiodic: it is 
the case of the image of any quasiperiodic Sturmian word starting with a by the Sturmian morphism 
L n a R h : a ^ a n+1 b,b^a n b. 

A consequence of Fact 15. II is: 

Lemma 5.2 For all Sturmian word w and x G {a,b}, L x R x {w) = R x L x (w) is quasiperiodic. 

Proof. Without loss of generality, assume x = a. From Theorem IM.21 L a R a = R a L a . Let us 
recall that L a R a (a) = a and L a R a (b) = aba. From Fact 14.21 if w is a quasiperiodic word, then 
L a R a {w) is quasiperiodic. Assume now that w is a Sturmian non-quasiperiodic word. By Fact 15. H 
w belongs to a n+1 b{a n b, a n+1 b} UJ U {ba n ,ba n+1 } UJ for an integer n. Hence L a R a (w) belongs to one 
of the sets a n+1 aba{a n aba,a n+1 aba} UJ or {abaa n , abaa n+1 } w . So L a R a (w) is a n+2 6a-quasiperiodic 
or a6a n+2 -quasiperiodic. □ 

Let us observe that ba^ and L a R a {ba^) = aba u are not quasiperiodic. This shows that Lemma l5~2l 
is not true for arbitrary words (even if they are balanced), unlike the next fact which is a direct 
consequence of the definition of L a Li,: a i— ► aba, b i— > ab, and LbL a : a i— » ba, b i— > bob. 

Fact 5.3 For any infinite word w, L a Lb(w) is aba- quasiperiodic and Li,L a (w) is bab- quasiperiodic. 

Lemma l5~2l a,nd Fact 15 .31 will be useful to prove that our condition in Theorem 15. 6l is necessary. To 
show it is sufficient, we now consider situations where the image of a word by a Sturmian morphism 
is not necessarily quasiperiodic. 

Lemma 5.4 Let x G {a, b} and let w be a balanced word starting with x. The word L x {w) is 
quasiperiodic if and only if w is quasiperiodic. Moreover in this case, the smallest quasiperiod of 
L x (w) is the word L x (v) where v is the smallest quasiperiod of w. 

Proof. Without loss of generality, we consider here that x = a. 
From Fact 14 .'2\ if w is quasiperiodic then L a (w) is quasiperiodic. 

From now on we assume that L a (w) is n-quasiperiodic where u is the smallest quasiperiod of 
L a {w). If w has at most one occurence of 6, then w = o? or w = a n ba u ' for an integer n > 0. Since 
L a (w) is quasiperiodic, we have w = a u and we verify that the smallest quasiperiod of it; and L a {w) 
is a = L a (a). From now on we assume that w contains at least two occurrences of the letter b. 
Denoting by n the least number of a between two consecutive occurrences of b in w and by i the 
number of a before the first b, since w is balanced, w G a l {ba n , 6a n+1 } w and < i < n + 1. 

If < i < n, then w and L a (w) are quasiperiodic with respective smallest quasiperiod a l ba n ~ l+1 
<mda i+1 ba n - i+l = L a {a i ba n - i+1 ). 

By hypothesis, w starts with a, so we cannot have i = 0. 

In the case i = n + 1: w G a n+1 b{a n b, a n+1 b} w and L a (w) G a n+2 {ba n+1 , ba n+2 } u . Since u is a 
quasiperiod of L a (w), u is a prefix of L a (w) and starts with a n+2 b. By Lemma l4~T| uu is a factor 
of L a (w). It follows that u ends with b and u = L a (v) for a word v G {a n b, a n+1 b}*. Now we prove 
that v is a quasiperiod of w. Let (pku)k>o be a covering sequence of L a {w) (po = e and for all 
k > 0, pku is a prefix of L a (w) and |pfc+i| — \pk\ < M)- Since u starts with a n+2 b, for each k > 0, 
there exists a word p' k such that p^ = L a (p' k ). Of course, p' Q = e. Since v G {a n b,a n+1 b}* , we can 
deduce for each k > that is a prefix of w. If for a k, \p' k+1 \ — \p' k \ > \v\, then p' k+1 = p' k vy for 
a word y and consequently Pk+i = PkuL a (y) which contradicts the fact that \pk+i \ — \Pk\ < \u\. So 
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for each k > 0, \p' k+ i\ — \p' k \ < \v\. We have shown that {p' k v) k >o is a covering sequence of w, so u 
is a quasiperiod of w. Assume w has a quasiperiod v' strictly smaller than v. Both v and v' are 
prefixes of w, so v = v's for a non-empty word s. Then |L a (f')| = |-L a (i>)| ~~ l-^a( s )l < l^a(^)| and 
L a {v') is a quasiperiod of L a {w) strictly smaller than u = L a (v). This contradicts the definition of 
u, so v is the smallest quasiperiod of w. 

□ 

Lemma 5.5 Let x,y be letters such that {x, y} = {a, b} and let w be a word starting with x. 
The word R y {w) is quasiperiodic if and only if w is quasiperiodic. Moreover when these words are 
quasiperiodic, the smallest quasiperiod of R y {w) is the word R y (v) where v is the smallest quasiperiod 
ofw. 

Proof. Without loss of generality, we consider here that x = a and y = b. 
From Fact 14 .2| if w is quasiperiodic then Rb(w) is quasiperiodic. 

Assume now that Rb(w) is quasiperiodic and let u be its smallest quasiperiod. By hypothesis, 
w starts with a, so does u. Since aa is not a factor of Rb{w) whereas by Lemma l4~T1 uu is a factor 
of Rb{w), we deduce that u ends with b. Thus there exists a word v such that u = Rb(v). As done 
in the proof of Lemma IQ1 for the case w G a n+1 {ba n , ba n+l } u) , we can show that v is a quasiperiod 
of u and more precisely that it is its smallest quasiperiod. □ 

The reader can observe one difference between the two previous lemmas: Lemma IQl considers 
only balanced words when Lemma 15.51 works with arbitrary words (starting with x). Note that 
Lemma IB~4l becomes false if we do not consider balanced words. Indeed the word w = abab^aaab) 1 ^ 
is not quasiperiodic, whereas L a (w) = aabaabaafaabaa)^ is aa&aa-quasiperiodic. The two lemmas 
become also false if we consider Sturmian words starting with y where {x, y} = {a, b}. Indeed, let us 
consider the case x = a, y = b: it is known [Jj that the word w = (LbR a ) u '(a) is not quasiperiodic; 
this Sturmian word starts with b and the word L a (w) (resp. Rb(w)) is afra-quasiperiodic (resp. 
frafc-quasiperiodic) . 

We can now establish the announced characterization of non-quasiperiodic Sturmian words. 

Theorem 5.6 A Sturmian word w is not quasiperiodic if and only if it can be infinitely decomposed 
over {L a , Rb} or over {Lb, R a }- Ln other words a Sturmian word w is not quasiperiodic if and only 
if 

w = Ym^L^RfLtR^ . . . L^R^{a) 

or 

w = Ym^LtRtLfRt . . . L%»-* B** (a) 
where d k > 1 for all k > 2 and d\ > 0. 

Proof. We first show that the condition is necessary. Let w be a non-quasiperiodic Sturmian word. 
By Theorem 13. 11 

w = Km lft-^BgLf-^Rg . . . L d ^- c ^R c ^L d h ^- C2n R c ^{a) 

where d k > c& > for all integer k > 1, d k > 1 for k > 2 and if c k = (4 then c k -\ = 0. 

By Lemma l5~2| for x £ {a,b} and any Sturmian word, L x R x (w) is quasiperiodic. By Fact 14.21 
this implies that for all k > 1, c k = d k or c k = 0. 
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Assume that Ck = and c^+i = for an integer k > 1. Then w = fL a Lb{w') or w = fLbL a (w') 
for a Sturmian word w' and a morphism /. By Fact 15.31 w is quasiperiodic. So for each fc > 1, 
c fc = implies c fc+ i = d k+1 . 

We know that for each k > 2, = (4 implies Cfc_i = 0. This is equivalent to say that for each 
k > 1, Cfc ^ implies Cfc +1 7^ dfc+i. But there for each /c, = c4 or = 0. Thus = c4 implies 
c fe+i = 0, the condition is necessary. 

Let us now show that any Sturmian word w that can be decomposed infinitely over {L a , R^} is 
not quasiperiodic (case {Lb,R a } is similar). Assume by contradiction that it is not the case. Let <S 
be the set of all Sturmian words w that can be decomposed over {L a , Rb} and that are quasiperiodic. 
Let u be a quasiperiod of smallest length among all quasiperiods of words in <S, and let w be an 
element of S having u as quasiperiod. By definition, w = L a (w') or w = Rb(w') for a word w' in S. 
Since ds 7^ 0, w starts with the letter a. By Lemmas 15.41 and 15 .5( u = L a {y) or u = Rb(v) where v 
is the smallest quasiperiod of w' . Since a u and are not Sturmian words (they are balanced but 
not ultimately quasiperiodic), \v\ a ^ and \v\b 7^ 0. Consequently \v\ < \u\. This contradicts the 
choice of u. Hence S is empty. □ 

Given a word w, let us denote X(w) the set of infinite words having the same set of factors than 
w. X(w) is invariant by the shift operator and is called the subshift associated with w. When w is 
Sturmian, it is known (see p]) that a word w' belongs to X(w) if and only if it is Sturmian and the 
associated sequence (dk)k>o in its decomposition of Theorem 13 .11 is the same as the one involved in 
the decomposition of w. 

To end this section, we observe that any standard Sturmian word (that is a Sturmian word 
that can be decomposed using only L a and Lb) is necessarily quasiperiodic. This gives a new 
proof of a result by T. Monteil ^SIEJ: any Sturmian subshift contains a quasiperiodic word (let us 
mention that the resutl of T. Monteil is more precisely: any Sturmian subshift contains a multiscaled 
quasiperiodic word, that is a word having an infinity of quasiperiods). The interested reader will find 
materials in Sectional to show that any standard Sturmian word has an infinity of quasiperiods (see 
Lemma l7.5|) . Theorem 15.61 also shows that in any Sturmian subshift, there is a non-quasiperiodic 
word. 

6 A connection with Lyndon words 

The aim of this short section is to give another characterization of non-quasiperiodic Sturmian words 
related to Lyndon words (see Theorem 16 .51 belowl . 

Let us recall notions on finite |Hj and infinite Lyndon words . We call suffix of an infinite 
word w any word w' such that w = uw' for a given word u. When u 7^ e, we say that w' is a proper 
suffix of w. This definition allows us to adopt the same definition for finite and infinite Lyndon 
word. Let ■< be a total order on A (in what follows, {a ~< b} denotes the alphabet {a, b} with a -< b). 
This order can be extended into the lexicographic order on words over A. A (finite or infinite) word 
over (A, ^) is a Lyndon word if and only if w is strictly smaller than all its proper suffixes. Any 
infinite Lyndon word has infinitely many prefixes that are (finite) Lyndon words (and so an infinite 
Lyndon word can be viewed as a limit of these prefixes). The following basic property of finite 
Lyndon words was pointed out by J. P. Duval (see Acknowledgments): 

Fact 6.1 Any finite Lyndon word is unbordered, that is the only borders of a Lyndon word w are e 
and w. 
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This allows us to state a relation between infinite Lyndon words and non-quasiperiodic infinite 
words (cf Corollary 16. 3|) . 

Fact 6.2 If w is an infinite u-quasiperiodic word, then any prefix of w of length at least \u\ + 1 is 
not unbordered. 

Proof. If p is a prefix of w of length at least \u\ + 1, then p has for suffix a prefix s of u (of length 
at most \u\). Since u is a prefix of w, u is also a prefix of p, and so s is a border of p. q 

Corollary 6.3 Any Lyndon word is not quasiperiodic. 

Our main Theorem l6.5l is a direct consequence of this corollary and the following characterization. 
Following [J5I we sa y that a morphism / preserves (finite) Lyndon words if for any (finite) Lyndon 
word u, f{u) is also a Lyndon word. We have: 

Proposition 6.4 |16j A Sturmian morphism f preserves Lyndon words over {a -< b} if and only if 

fe{L a ,R b }*. 

Theorem 6.5 A Sturmian word w over {a,b} is non-quasiperiodic if and only if w is an infinite 
Lyndon word over {a -< b} or over {b -< a}. 

Proof. Let w be a Sturmian word. By corollary 16 .'A\ if w is an infinite Lyndon word then w is not 
quasiperiodic. 

Assume now that w is not quasiperiodic. By Theorem 15. 6| w = lim L^Rf 2 . . . L^ 2 "" 1 R d2n (a) or 

w = lim L^Rf . . . L d2n ~ 1 R d a 2n (a) for some integers (4) fc >i such that d k > 1 for all k > 2 and di > 

0. Proposition 16 .41 implies that, since a is a Lyndon word, for each n > 1, L dl R^ 2 . . . La 2 "' 1 R^ 2n (a) 
is a Lyndon word over a -< b and L^R^ 2 . . . L^ 2 " -1 R d a 2n (a) is a Lyndon word over b -< a. Hence w 
is an infinite Lyndon word over a -< b or over b -< a. rj 

To end this section we study the converse of Corollary 16 ..SI and Fact 16.21 

The converse of Corollary 16.31 is not true in general. For instance we can consider any Sturmian 
word w over {a, b} and the word p = ababaaa. Then pw is not quasiperiodic since p is not balanced 
and so not a factor of w. Moreover, since p starts with the letter a, pw cannot be a Lyndon word 
if b ~< a. It is neither a Lyndon word if a -< b since for any prefix p' of w, aaap' -< w. 

The converse of Fact 16. 21 is also false: let w be an infinite word and p be an integer, if all prefixes 
of w of length greater than p + 1 are unbordered, then w is not necessarily quasiperiodic. To prove 
this, it is sufficient to consider the word w = aba^ . 

A more complex but interesting example, pointed out by P. Seebold (see Aknowledgements) , 
is the well-known Thue-Morse word T, fixed point of the morphism \x such that /j,(a) = ab and 
fi(b) = ba. The word T starts with abb and any prefix of length at least 4 ends with o, ab or abb. 
But T is not quasiperiodic: indeed it is well-known that T is overlap-free (a word is overlap-free 
if it contains no factor of the form xuxux where x is a letter, or equivalently it contains no factor 
that can be written both pv and vs with \p\ < \v\) and we can observe that: 

Fact 6.6 An overlap-free infinite word is never quasiperiodic. 



8/0! 



Proof. Let w be a u-quasiperiodic infinite word and let (p n u) n >o be a covering sequence of w. 
If there exists n > such that |p n +l| ~~ \Pn\ < M> then p n +iu = p n us for a word s such that 
s = |Pn+l| ~~ \Pn\ < \u\. Hence there exists a word p such that us = pu, then w is not overlap-free. 
If for all n > we have |p n +i| ~ \Pn\ = \u\, then w = u u is also not overlap-free. □ 

Finally let us mention that this fact is not valid for finite words since there exist some overlap-free 
words that are square (see ^H], cf. also for a characterization of such words). 

7 Sturmian morphisms and quasiperiodicity 

We say that a morphism / is quasiperiod-free if for any non-quasiperiodic word w, f(w) is also non- 
quasiperiodic. A non-quasiperiod-free morphism will just be called quasiperiodic. Let us observe 
that all Sturmian morphisms (except E and Id) are quasiperiodic. To verify it, it is sufficient to 
show that L a , Lb, R a and Rb are quasiperiodic. For L a and R a (case Lb and Rb are similar) we 
have: aba^ and ab^ are non-quasiperiodic although L a (aba UJ ) = abafab)^ and R a (ab u) ) = 0(60)^ are 
a&a-quasiperiodic. 

In the previous section, we encounter (Lemma PS] and Fa.ct l5.Hjl two different kinds of Sturmian 
morphisms. The morphism L a Lb maps any word into a quasiperiodic one, whereas there exists a 
non-quasiperiodic word w such that L a R a {w) is not quasiperiodic. Generalizing these two examples 
we observe that the set of quasiperiodic morphisms can be partitioned using the following notions: 

1. A morphism / on A is called strongly quasiperiodic (resp. on a subset X of A^) if for each 
non-quasiperiodic infinite word w (resp. w € X), f(w) is quasiperiodic. 

2. A morphism / on A is called weakly quasiperiodic (resp. on a subset X of A u ) if there exist 
two non-quasiperiodic infinite words w, w' (resp. w, w' G X) such that f(w) is quasiperiodic, 
and f{w') is non-quasiperiodic. 

The aim of this section is to answer the two following questions: 

• Which are the strongly (resp. weakly) quasiperiodic Sturmian morphisms? 

• Which are the strongly (resp. weakly) quasiperiodic Sturmian morphisms on (the set of) 
Sturmian words? 

We note that the two questions have different answers. Indeed L a R a as shown by Lemma l5~2*l is 
strongly quasiperiodic on Sturmian words, but as already said, L a i? a (6a w ) is not quasiperiodic. Of 
course, any strongly quasiperiodic Sturmian morphism is strongly quasiperiodic on Sturmian words, 
or equivalently (since a Sturmian morphism is quasiperiodic), any weakly quasiperiodic Sturmian 
morphism on Sturmian words is weakly quasiperiodic. 

7.1 A property of strongly quasiperiodic morphisms 

Before going further, we mention the following immediate result: 

Lemma 7.1 Let f be a morphism. If there exist morphisms f\, fi, such that f = /1/2/3 and 
such that f'2 is strongly quasiperiodic, then f is strongly quasiperiodic. 
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We observe that (quite naturally) Lemma 17. II becomes false when replacing strongly quasiperi- 
odic by weakly quasiperiodic. For instance, taking f\ = Id, f 2 = L a and fs = L b , we have f 2 
weakly quasiperiodic and /1/2/3 strongly quasiperiodic. There are cases where we can have f 2 
weakly quasiperiodic and j 1/2/3 quasiperiod-free, but this is not possible when f\, f 2 and fa are 
Sturmian morphisms since all Sturmian morphisms are quasiperiodic. To give an example of such 
a case, we need the following result: 

Lemma 7.2 The morphism g defined by g(a) = abab and g(b) = aaaa is a quasiperiod-free mor- 
phism. 

Proof. Let w be an infinite word such that g(w) is quasiperiodic. We show that w is also 
quasiperiodic. Let u be the smallest quasiperiod of g(w). Since u is a prefix of g(w), u = g(v)p for 
a proper prefix p of g{a) = abab or of g(b) = aaaa: p € {e, a, aa, aaa, ab, aba}. First we observe that 
if a or b does not occur in w, then w is quasiperiodic. From now on we assume that both a and b 
occur in w. Consequently \v\ a ^ and \v\ b 7^ 0. It follows that g(v) starts with (ab) 2n aaaa for an 
integer n > and with a 4m abab for an integer m > 0: of course m = or n = 0. Moreover g{v) 
ends with aaaa{ab) 2n for an integer n' > and with ababa 4m for an integer m' > 0: once again 
m' = or n' = 0. By Lemma [4. 1| uu is a factor of g(w). We then deduce that p = e since for all the 
other potential values, none of the words in {aaaa(ab) 2n ,ababa 4m }p{(ab) 2n aaaa, a 4m abab} could 
be a factor of g{w). Let (piu)i>o be a covering sequence of prefixes of g(w). As done in the proof of 
Lemma we can find a covering sequence (p;f)z>o of prefixes of w: the word v is a quasiperiod 
of w. □ 

Now let us consider the morphisms f\ = Id, f 2 = L a , and ^3 defined by /3(a) = 66, ^3(6) = aaaa. 
By the previous lemma /1/2/3 = g is quasiperiod-free whereas f 2 is weakly quasiperiodic. 

To end this section, we let the reader verify that fs is quasiperiod-free and more generally that 
any morphism h defined by h(a) = a 1 , h{b) = 6 ? with i > 1 and j > 1 is quasiperiod-free. 

7.2 Weakly and strongly quasiperiodic Sturmian morphisms 

In this section, we characterize weakly quasiperiodic Sturmian morphisms. (Equivalently this char- 
acterizes strongly quasiperiodic Sturmian morphisms since any Sturmian morphism is weakly or 
strongly quasiperiodic.) 

Proposition 7.3 A Sturmian morphism is weakly quasiperiodic if and only if it belongs to the set 

{E, Id}{L a ,R b }*{L a , R a }* U {E, Id}{L b , R a }*{L b , R b }* . 

The proof, given at the end of the section, is a consequence of the next lemmas. 

Lemma 7.4 Let f be a morphism in {L a , L b , R a , R b }* different from the identity. The morphism 
f belongs to {L a , R b }*{L a , R a }* U {L b , R a }*{L b , R b }* if and only if f cannot be written f = /1/2/3 
with fi, fs € {L a , L b , R a , R b }* and f 2 verifying one of the four following properties: 

1- /2 S L a {L a , L b , R a , R b }* L b U L b {L a ,L b ,R a ,R b }*L a , or 

2. h = R a gL a with g £ {R a ,L a }* or f 2 = R b gL b with g £ {R b ,L b }*, or 

3. h G RaB+Ra or f 2 G R b R+R b , or 

10/0! 



4. h 6 RiL+R b = L+R+R b or f 2 E itf = L+R+R a . 

Proof. First we let the reader verify using Theorem 13.21 that if / belongs to {L a , Rb}* {L a , R a }* U 
{Lb, R a }*{Lb, Rb}* then it cannot be written / = /1/2/3 with fi,f 2 ,f3 as in the lemma. 

From now on assume that / cannot be written / = /1/2/3 with /i,/2,/3 as in the lemma. Let 
gi, . . . ,g n (n > 1 since / is not the identity) in {L a , Lb, R a , Rb} such that / = g\ . . . g n . 

We first consider the case where g\ = L a . By Impossibility 1 for f 2 , for each i > 1, gi ^ Lb- If 
there exists an integer i > 1 such that gi = R a , then g\ ■ ■ .gi = hL a R l a or g± ... gi = hRbR l a for a 
morphism h and an integer I > 1. In the first case by Impossibility 4 for f 2 , for all integer j > i, 
fj ytz R b _ In the second case by Impossibilities 3 and 4 for f 2 , for all integer j > i, we also have 
fj + R b . Thus / E L a {R b ,L a }*{L a ,R a }*. 

Assume now the more general case (than g\ = L a ) where there exists an integer i > 1 such that 
gi = L a and gj 7^ L a for 1 < j < i (the first occurrence of L a appears at the position i), Samely as 
above, we show that g = g%. . .g n E L a {Rb, L a }*{L a , R a }* . By Impossibility 1 for f 2 , for each integer 
j, 1 < j < i, gj 7^ Lb. Thus gj E {R a ,Rb} for each 1 < j < i. We have three cases: If / E R* a g, 
then by Impossibility 4 for f 2 , we have / E L a {R b ,L a }*{L a ,R a }* U {i? a ,L a }*. If / E hR+R* a g for 
a morphism h E {i? a ,i?b}*, then by Impossibility 2 for /2, h € Rfc and so / E R^R*g; then by 
Impossibilities 3 and 4 for / 2 we have / E {L a , fi b }*{L a , -R a }*. If / E / E {L a , i? b }*{L a , fi a }*. 
So when there exists an integer i > 1 such that ^ = L a , / E {L a , i?ft}*{L a , R a }* . 

The case where there exists an integer i > 1 such that ft = leads similarly to / E 
{L 6 ,i4}*{L 6)J R 6 }*. 

Now we have to consider the case where for alH, 1 < i < n, gi $ {L a , Lb}. Then by Impossibility 
3 for f 2 , necessarily, / E R* a Rl U n 

Lemma 7.5 Every morphism f in L a {L a , Lb, R a , Rb}* Lb U Lb{L a , Lb, R a , Rb}*L a is strongly 
quasiperiodic. 

Proof. We only prove the result for / in L a {L a , Lb, R a , Rb}*L b (the other case is similar exchanging 
the roles of the letters a and b). Let / = La/1/2 • • • fnLb with n > and fi in {L a , Lb, R a , Rb} for all 
1 < i < n. We prove by induction on n that there exist morphisms g and h such that / = gL a Lbh 
(and so from Lemma f7.ll and Fact 15.31 / is strongly quasiperiodic). The property is immediate 
for n = 0. Assume now n > 1. If there exists i between 1 and n such that /» = L a or /j = 
we can apply the induction hypothesis and Lemma ITTTI to conclude. Now suppose that for all i, 
fi {L a -> Lb}. Three cases are possible: 

• if /1 = R a , since L a R a = R a L a from Theorem 13. 2| / = R a L a f\ . . . f n Lb and we conclude by 
the induction hypothesis. 

• If f n = Rb we can proceed similarly. 

• Assume now f\ = i? and f n = R a (this implies n > 2). Let j be the greatest integer 
(1 < j < n) such that fj = R b , Then / = L a f x ... f j ^ 1 R b R2~ i L b , and by Theorem E3 
/ = L a fi . . . fj_\LbL"^ 3 Rb. We conclude by the induction hypothesis. 

□ 

Remark: we could have used another approach observing that L a Rb(w) (L a Rb(a) = aab, 
L a Rb(b) = ab) is a6a-quasiperiodic for every infinite word w starting with b, and deducing that 
every morphism of the form L a RbfLb with / E {L a ,R a ,R }* is strongly quasiperiodic. 
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Lemma 7.6 Every morphism f = R a gL a with g $ {R a ,L a }* or f = RbgLb with g {Rb,Lb}* is 
strongly quasiperiodic. 

Proof. We only prove the first case, the other one is similar. Let g = g\ . . . g n (necessarily n > 1) 
such that g G" {R a ,L a }* and for each i between 1 and n, gi G {L a ,Lb,R a ,Rb}- If there exists an 
integer i such that gi = Lb then the result is immediate from Lemma f7.5l Consequently we consider 
that g G ({L a ,R a }*Rb) + {L a ,R a }* . Thus the morphism / can be decomposed / = f\hf 2 with h G 
R a L*R^ R*L a . If i, j > 0, k > 1 are the integers such that h = L l a R a R^L a R J a , Theorem 13.21 shows 
that h = L l a L a L^R a R J a . Consequently Lemmas 17, II and 17.51 imply that h is strongly quasiperiodic. 

□ 

Remark: here again we could have used another approach observing that R a Rb(w) (R a Rb(a) = 
aba, R a Rb(b) = ba) is afra-quasiperiodic for every infinite word w starting with a, and deducing that 
every morphism of the form R a RbfL a with / G {L a , R a , Rb}* is strongly quasiperiodic. 

This approach is used to prove: 

Lemma 7.7 Any morphism f in R a R^R a U RbR^Rb is strongly quasiperiodic. 

Proof. Let j > 1 be an integer such that / = R a R 3 b R a . Let w be a word. If w starts with b, 
RbR a {w) is 6a6-quasiperiodic, and so f(w) is quasiperiodic. If w starts with a, R^' 1 R a (w) also 
starts with a. Then R a RlR a (a) is afra-quasiperiodic. □ 

Lemma 7.8 Every morphism f in R^L^Rb = L^R^Rb or in R^L^R a = L^R^ R a is strongly 
quasiperiodic. 

Proof. Theorem E21 implies R+L+Rb = L+R+R b and R^L^Ra = L+R+R a . 

We prove only the first case, the other one is similar. Let n > 1. It is easy to see that R a L™Rb(w) 
(R a L2Rb(a) = aa n ba, R a L™Rb(b) = a n ba) is a n+1 6a-quasiperiodic if w starts with a, and is a n baa- 
quasiperiodic if w starts with b. By Lemma l7~Tl any morphism in R+L+Rb is quasiperiodic. rj 

Proof of Proposition 1 7. !A 

Prom Theorem 13.21 EL a = LbE and ER a = RbE, so any Sturmian morphism can be written fg 
with / G {Id, E} and g G {L a , Lb, R a , Rb}* ■ Thus Proposition 17. 31 is a consequence of the following 
one: a morphism / in {L a , Lb, R a , Rb}* is weakly quasiperiodic if and only if / belongs to the set 
X = {L a ,R b y{L a ,R a Y U {L b ,R a y{L b ,Rb}*. 

To prove this, assume first that / G {L a , Lb, R a , Rb}* is weakly quasiperiodic. By Lemma l7~T| 
this morphism cannot be written / = /1/2/3 with f 2 a strongly quasiperiodic morphism. Hence by 
Lemmas Ol ESI EEIO and EH1 / belongs to X. 

Assume now that / G X. Since / is Sturmian, it is quasiperiodic and so we just have to prove 
the existence of one word such that f(w) is not quasiperiodic. So we just have to prove the existence 
of one word w such that f(w) is not quasiperiodic. We do it for / G {L a , Rb}*{L a , R a }* (the other 
case is similar). There exist morphisms g G {L a , Rb}* and h G {L a , R a }* such that / = gh. We can 
verify that h{aba w ) = a n ba UJ for an integer n > 1, and so is a balanced word. By Lemmas 15.41 and 
15. 5| we thus deduce that {g{h{aba u ')) = f(aba UJ ) is not quasiperiodic. □ 
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7.3 Weakly Sturmian morphisms on Sturmian words 

Proposition 17.31 and Lemma 15.21 show that some morphisms, as for instance L a R a , are weakly 
quasiperiodic whereas they are strongly quasiperiodic on Sturmian words. This section allows us 
to characterize all these morphisms. Let us recall that since a Sturmian morphism is quasiperiodic, 
any Sturmian morphism is weakly or strongly quasiperiodic on Sturmian words. 

Proposition 7.9 A Sturmian morphism different from E and Id is weakly quasiperiodic on Stur- 
mian words if and only if it belongs to {E, Id}{L a , Rb}* U {E, Id}{Lb, R a }* ■ 

Proof. 

Let us make a preliminary remark: for any morphism /, / is weakly quasiperiodic on Sturmian 
words if and only if Ef is weakly quasiperiodic on Sturmian words (since for any word w, w is 
quasiperiodic if and only if E{w) is quasiperiodic). 

Assume first / G {E, Id}{L a , R b }*U{E, Id}{L b , R a }* ■ Without loss of generality, we can assume 
/ G {L a ,Rb}* U {Lb,R a }*- If / belongs to {L a ,Rb}* (resp. to {L&, R a }*), using Theorem 15.61 we 
observe that f{{L a R\,) u ) (resp. f ((L^Ra)^)) is not quasiperiodic. Since any Sturmian morphism is 
quasiperiodic, / is weakly quasiperiodic on Sturmian words. 

Now assume / is weakly quasiperiodic on Sturmian words. Observe that from Theorem 13.2( 2). 
/ G {E,Id}{L a ,Lb,R a ,Rb}* ■ Without loss of generality, from the preliminary remark, we can 
assume that / belongs to {L a , Lb, R a , Rb}* and prove that / G {L a ,Rb}* U {Lb,R a }*- By Propo- 
sition ESI / belongs to {L a , Rb}*{L a , R a }* U {Lb, R a }*{Lb, Rb}* ■ Assume by contradiction that 
/ {Lai Rb}* U {Lb, R a }* ■ One of the following four cases holds: 

1. / = gL a R a with g G {L a , R b }*{L a , R a }*; 

2. / = gR b Ri with g G {L a ,R h }*, i > 1; 

3. / = gL b R b with g G {L b , R a }*{L b , R b }*; 

4. / = gR a R b with g G {L b ,R a }*, i > 1. 

Case 1: Assume / = gL a R a and let w be a non-quasiperiodic Sturmian word. By Lemma \cT3\ 
f(w) is quasiperiodic. 

Case 2: Assume / = gRbR a and let w be a non-quasiperiodic Sturmian word. By The- 
orem |03 w can be decomposed over {L a ,Rb} or over {Lb,R a }- So f(w) = gRbR a L a {w') or 
f(w) = gRbR l a Rb{w') or f(w) = gRbR a +:J L b {w') for a (non-quasiperiodic) Sturmian word w' and 
an integer j > 0. Thus by Lemma l5~2*j Lemma ITTTI and Lemma I71H f(w) is quasiperiodic. 

Cases 3 and 4 are respectively similar to cases 1 and 2. In all cases, f(w) is quasiperiodic for 
any non-quasiperiodic Sturmian word w, and so for any Sturmian word (by Fact 14 .2|) . Thus / is 
strongly quasiperiodic on Sturmian words. This is a contradiction, so / G {L a ,Rb}* U {Lb,R a }*- 
□ 
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